Parametric or Nonparametric? a Parametricness Index for Model Selection
نویسندگان
چکیده
In model selection literature two classes of criteria perform well asymptotically in different situations: Bayesian information criterion (BIC) (as a representative) is consistent in selection when the true model is finite dimensional (parametric scenario); Akaike’s information criterion (AIC) performs well in an asymptotic efficiency when the true model is infinite dimensional (nonparametric scenario). But there is little work that addresses if it is possible and how to detect the situation that a specific model selection problem is in. In this work, we differentiate the two scenarios theoretically under some conditions. We develop a measure, parametricness index (PI), to assess whether a model selected by a potentially consistent procedure can be practically treated as the true model, which also hints on AIC or BIC is better suited for the data for the goal of estimating the regression function. A consequence is that by switching between AIC and BIC based on the PI, the resulting regression estimator is simultaneously asymptotically efficient for both parametric and nonparametric scenarios. In addition, we systematically investigate the behaviors of PI in simulation and real data and show its usefulness.
منابع مشابه
To “ Parametric or Nonparametric ? a Parametricness Index for Model Selection ”
BIC is used to select the order of polynomial regression between 1 and 30. The estimated σ from the selected model is used to calculate the PI. Representative scatterplots at n = 200 with σ1 = 3, σ2 = 7 can be found in Figure 1. Note that the function estimate based on the selected model by BIC is visually more different from that based on the smaller model with one fewer term for the parametri...
متن کاملA comparison of parametric and non-parametric methods of standardized precipitation index (SPI) in drought monitoring (Case study: Gorganroud basin)
The Standardized Precipitation Index (SPI) is the most common index for drought monitoring. Although the calculation of this index is usually done by using the gamma distribution fitting of precipitation data, studies have shown that for accurate monitoring of drought, the optimal distribution of precipitation in each month should be determined. On the other hand, in non-stationary time series,...
متن کاملSemiparametric regression models with additive nonparametric components and high dimensional parametric components
This paper concerns semiparametric regression models with additive nonparametric components and high dimensional parametric components under sparsity assumptions. To achieve simultaneous model selection for both nonparametric and parametric parts, we introduce a penalty that combines the adaptive empirical L2-norms of the nonparametric component functions and the SCAD penalty on the coefficient...
متن کاملCovariate selection for semiparametric hazard function regression models
We study a flexible class of non-proportional hazard function regression models in which the influence of the covariates splits into the sum of a parametric part and a time-dependent nonparametric part. We develop a method of covariate selection for the parametric part by adjusting for the implicit fitting of the nonparametric part. Our approach is based on the general model selection methodolo...
متن کاملInvestigating the Factors Affecting Energy Consumption in the Iranian Agricultural Sector Using Parametric and Nonparametric Methods
Abstract In order to study energy consumption in Iran's agricultural sector, a Genetic algorithm concept was used to calculate significant factors affecting energy consumption between 1974 and 2008. Then, durability or "stability" of variables was assessed through econometric method (Augmented Dickey-Fuller test). In addition, long-term and short-term relationships of energy consumption were es...
متن کامل